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Abstract 



Given a capacitated graph G — {V, E) and a set of terminals K Q V, how should we produce a graph H 
only on the terminals K so that every (multicommodity) flow between the terminals in G could be supported 
in H with low congestion, and vice versa? (Such a graph H is called aflow-sparsifier for G.) What if we want 
// to be a "simple" graph? What if we allow // to be a convex combination of simple graphs? 

Improving on results of Moitra [FOCS 2009] and Leighton and Moitra [STOC 2010], we give efficient 
algorithms for constructing; (a) a flow-sparsifier H that maintains congestion up to a factor of O(j^^^j), 
where k - \K\. (b) a convex combination of trees over the terminals K that maintains congestion up to a factor 
of (9(log k). (c) for a planar graph G, a convex combination of planar graphs that maintains congestion up to 
a constant factor This requires us to give a new algorithm for the 0-extension problem, the first one in which 
the preimages of each terminal are connected in G. Moreover, this result extends to minor-closed families of 
graphs. 

Our improved bounds immediately imply improved approximation guarantees for several terminal-based 
cut and ordering problems. 
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1 Introduction 



Given an undirected capacitated graph G = (V, E) and a set of terminal nodes K cV,we consider the question of 
producing a graph H only on the terminals K so that the congestion incurred on G and H for any multicommodity 
flow routed between terminal nodes is similar. Often, we will want the graph H to be structurally "simpler" than 
G as well. Such a graph H will be called aflow-sparsifier for G; the loss (also known as quality) of the flow- 
spai^sifier is the factor by which the congestions in the graphs G and H differ. For instance, when K = V, the 



results of [ ]Rac08| ] give a convex combination of trees H with a loss of 0{logn). We call this a tree-based flow- 
sparsifier — it uses a convex combination of trees. ^ Here and throughout, k - \K\ denotes the number of terminals, 
and n = |V| the size of the graph. 

For the case where K V, it was shown by Moitra | MoiOS ] and by Leighton and Moitra [^MIO] that for 



every G and K, there exists a flow-sparsifier H = {K,Eh) whose loss is 0{ 



), and moreover, one can 



lo ^loglog/."' 

efficiently find an H' = (K, Eh') whose loss is 0{^^^). They used these to give approximation algorithms for 
several terminal-based problems, where the approximation factor depended poly-logarithmically on the number 
of terminals k, and not on n. We note that they construct an arbitrary graph on K, and do not attempt to directly 



obtain "simple" graphs; e.g., to get tree-based flow-sparsifiers on K, they apply [ p^ac08[ ] to H', and increase the 
loss by an C?(log k) factor. 

In this paper, we simplify and unify some of these results: we show that using the general framework of in- 



terchanging distance-preserving mappings and capacity-preserving mappings from [Rac08] (which was reinter 



preted in an abstract setting by Andersen and Feige QAFOQQ ), we obtain the following improvements over the 
results of poiOQl LMlOll.^ 



1. We show that using the 0-extension results []CKR04|, |FHRT03| ] in the framework of [ ]Rac08| , |AF09| ] almost 
immediately gives us efficent constructions of flow-sparsifiers with loss O(j^j^j^). While the existential 
result of [^mT^ also used the connection between 0-extensions and flow sparsifiers, the algorithmically- 
efficient version of the result was done ab initio, increasing the loss by another C?(log k) factor. We use 
existing machinery, thereby simplifying the exposition somewhat, and avoiding the increased loss. 



2. We then use a randomized tree-embedding due to [GNRIO], which is a variant of the so-called FRT tree 



embedding [ FRT04 1 where the expected stretch is reduced to C?(log k) by requiring the non-contraction 



condition only for terminal pairs. Using this refined embedding in the framework of | |Rac08| , |AF09| ], we 
obtain efficient constructions of tree-based flow-sparsifiers with loss 0(log k). 

We then turn to special families of graphs. For planar graphs, we give a new 0-extension algorithm that 
outputs a convex combination of 0-extensions f : V ^ K (with f(x) = x for all x € K), such that all 
the corresponding 0-extension graphs Hf = (K,Ef) (namely, Ef = {(/(«), /(v)) : (m,v) € £")) are planar 
graphs, and its expected stretch max„ vgy Bytf/(/(")./W)]/jG(«,v) < 0(1). In particular, the planar graphs Hf 
produced are graph- theoretic minors of G. We remark that the known 0-extension algorithms [CKR04, 
AFH+04| , |LN05| ] do not ensure planarity of Hf. 



It follows that planar graphs admit a planar-based flow-sparsifier (i.e., which is a convex combination of 
capacitated planar graphs on vertex-set K) with loss 0(1), and that we can find these efficiently. The fact 
that flow-sparsifiers with this loss exist was shown by [ LMlOj ], but their sparsifiers are not planar-based. 

Moreover, the 0-extension algorithm itself can be viewed as a randomized version of Steiner point removal 
in metrics: previously, it was only known how to remove Steiner points from tree metrics with 0(1) dis- 
tortion [ pupOl |. We believe this randomized procedure is of independent interest; e.g., combined with an 



'Given a class T of graphs, we define an !F-flow-sparsifier to be a sparsifier that uses a single graph from f and an !F-based flow 
sparsifier to be a sparsifier that uses a convex combination of graphs from f. 

^Recently, it has come to our attention that, independent of and concurrent to our work, Charikar, Leighton, Li, and Moitra, and 
independently Makarychev and Makarychev, obtained results similar to the first two below, as well as related lower bounds. 
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embedding of 0GNRSO4| ], this gives an alternate proof of the fact that the metric induced on the vertices of 
a single face of a planar graph can be embedded into a distribution over trees [ LS09| ]. 



4. The results for planar graphs are in fact much more general. Suppose G is a j^G-decomposable graph (see 
definition in Section 1.1). Then we can efficiently output a distribution over graphs Hf = {K,Ef) such that 
these are all minors of G, and the expected stretch max„ ^UiHj(f(ii)J(v))ydQ{u,v) is bounded by Oific log fie)- 
Now applying the same ideas of interchanging distance and capacity preservation, given any G and K, we 
can find minor-based flow sparsifiers with loss Oific log/So)- 

5. Finally, we show some lower bounds on flow-sparsifiers: we show that flow-sparsifiers that are 0-extensions 
of the original graph must have loss at least Q( -y/logX) in the worst-case. For this class of possible flow 
sparsifiers, this improves on the n(log log k) lower bound for sparsifiers proved in [ |LM1(]| ]. We also show 
that any flow-sparsifier that only uses edge capacities which are bounded from below by a constant, must 
suffer a loss of Q( ^J\ogk| log log k) in the worst-case. 

We can use these results to improve the approximation ratios of several application problems. In many cases, 
constructions based on trees allow us to use better algorithms. Our results ai^e summarized in fTable 1 . Note that 
apart from the two linear-arrangement problems, our results smoothly approach the best known results for the 
case k = n. 





Previous Best Result 


Our Result 


Best Result when k - n 


Flow Sparsifiers (efficient) 




0{ ) 

^"'loglogi-' 




Tree-Based Flow 
Sparsifiers 




0(\ogk) 


0(log«) 


Minor-based Flow 
Sparsifiers 




0(J3c\og/3c) 




Steiner Oblivious Routing 


d{\og-k) 


OQogk) 


0(log«) 


^-Multicut 


0(\oik) 


Odogk) 


O(logn) 


Steiner Minimum Linear 
Arrangement (SMLA) 


diW'k) 


0(\og k log log k) 


0( ^j\og n log log «) 


SMLA in planar graphs 


OiXoi'k) 


O(loglog^) 


O(loglogn) 


Steiner Min-Cut Linear 
Arrangement 


OiXog'k) 


0(\oik) 


(9(logi-5„) 


Steiner Graph Bisection 


0(log«)'-,0(i|g,) 


OiXogk) 


0(log«) 



Table 1 : Summary of our results. Previous results marked with t from ||Rac08|, all others from [ |Moi09t [lmIoI i 



Many of these applications further improve when the graph comes from a minor-closed family (and hence has 
good yS-decompositions): e.g., for the Steiner Minimum Linear Arrangement problem on planar graphs, we can 
get an C?(log log ^)-approximation by using our minor-based flow-sparsifiers to reduce the problem to planar 



instances on the k terminals. Finally, in Appendix C we show how to get better approximations for the Steiner 
linear arrangement problems above using direct LP/SDP approaches. 



1.1 Notation 



Our graphs will have edge lengths or capacities; all edge-lengths will be denoted by ^ : £ — > R>o, and edge 
costs/capacities will be denoted by c : £ ^ R>o. When we refer to a graph (G,€), we mean a graph G with 
edge-lengths ^(0; similai^ly (//, c) denotes one with capacities c(-). When there is potential for confusion, we will 
add subscripts (e.g., c//(-) or ^g(O) for disambiguation. Given a graph (G,€), the shortest-path distances under 
the edge lengths t is denoted by • ^ x ^ ^ R.>o- 
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Given a graph G - (V, E) and a subset of vertices K QV designated as terminals, a retraction is a map / : V — > A' 
such that /(a;) = x for all a; € ^. For (G, c) and terminals ^ c V, a K-flow in G is a multicommodity flow whose 
sources and sinks lie in K. 

Decomposition of Metrics. Let {X,d) be a metric space with terminals K (Z X. K partition (i.e., a set of disjoint 
"clusters") P of X is called ^-bounded if every cluster S e P satisfies maxu^y^s^iu, v) < A. The metric (X, d) with 
terminals K is called fi-decomposable if for every A > there is polynomial time algorithm to sample from a 
probability distribution p over partitions of X, with the following properties: 



• Diameter bound: Every partition P e supp(//) is A-bounded. 

• Separation event: For all u,v e X, Prpg^[35' € P such that u e S but v ^ S] < p ■ d(u, v)/A. 



yS-decompositions of metrics have become standard tools with many apphcations; for more information see, e.g., 

[ pro5| ]. 

We say that a graph G = (V,E)is /3-decomposable if for every nonnegative edge-lengths the resulting shortest- 
path metric dg is yS-decomposable. Additionally, we assume that each cluster S in any partition P induces a 
connected subgraph of G; if not, break such a cluster into its connected components. The diameter bound and 
sepai^ation probabilities for edges remain unchanged by this operation; the separation probability for non-adjacent 
pairs (u, v) can be bounded by p ■ d{u, v)/A by noting that some edge on the u-v shortest path must be separated 
for (m, v) to be separated, and applying the union bound. 

2 0-Extensions 



In this section we provide a definition of 0-extension which is somewhat different than the standard definition. 



and review some known results for 0-extensions. We also derive in Corollary 2.4 a variation of a known result on 



tree embeddings, which will be applied in [Section 3 

A 0-extension of graph (G - {V, E), (q) with terminals K QV i?, usually defined as a retraction f : V ^ K. We 
define a 0-extension to be a retraction f : V ^ K along with another graph {H = (K, Eh), {h)\ here, the length 
function tu : En — > R+ is defined as €E{x,y) - dG{x,y) for every edge {x,y) € Eh- Note that this immediately 
implies dH{x,y) > dG{x,y) for all x,y e K. Note also that Hf defined in [Section 1 is a special case of H in which 
Eh = {(/(m),/(v)) : (u,v) € E], whereas, in general, H is allowed more flexibility (e.g., H can be a tree). This 
flexibility is precisely the reason we are interested both in the retraction / and in the graph H — we will often want 
H to be structurally simpler than G (just like we want a flow-sparsifier to be simpler than the original graph). 

For a (randomized) algorithm ^ that takes as input (G, £g) and outputs a (random) 0-extension {H, £h), the stretch 
factor of algorithm is the minimum a > I such that 



EH[dH(fix),f(ym<adGix,y) 
The following are well-known results for 0-extension. 



for all x,y e V. 



Theorem 2.1 ([FHRT03[) There is an algorithm JIfhrt for 0-extension with stretch a = ofhrt '■= 0{ 



log log/: 



)■ 



Theorem 2.2 ([CKR04|, see also [LN05]) If the graph is ^-decomposable, there is an algorithm ^cKRfor 0- 



extensions with stretch a = ockr '■= 0(fi). 



In particular, if the graph G belongs to a non-trivial family of graphs that is minor-closed, it follows from [ KPR93| , 
FT03I1 that a - 0(1). 
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2.1 0-Extension With Trees 



The following result is an extension of the tree-embedding theorem of Fakcharoenphol et al. [ ]FRT04| ], where the 
difference is that the following result ensures the non-contracting property (a) only for terminal-terminal pairs, 
but replaces the 0(log n) by 0(log k) in the expected stretch between any pair of nodes. 



Theorem 2.3 ([ GrNRlOp ) There is a randomized polynomial-time algorithm that takes as input a graph G 



{V, E) with terminals K QV and outputs a (random) edge-weighted 2-HST T = (I U L, Ej) with internal nodes I 
and leaves L, and a map f : V ^ L such that 

(a) dT{f{x),f(y)) > dQ{x,y) for all x,y e K (with probability I), 

(b) ET[dTW{x),f(y))] < Oilogk) dG{x,y) for all x,y € V, and 

(c) for each non-terminal v e V \ K, either there exists a terminal x^ sharing the leaf node with it (i.e., 
/(v) = f(Xv)), or another descendent of f{v)'s parent in T contains a terminal Xy. 

Corollary 2.4 (Tree 0-extension) There is a randomized polynomial-time algorithm ^icNRfor 0-extension that 
has acNR - Oilogk); furthermore, the graphs output by the algorithm, are trees on the vertex set K. 

Proof: To prove the corollary, we need to show an algorithm that takes as input a graph G - {V,E) with terminals 
K QV and outputs a (random) edge-weighted tree T - (K, E) and a retraction f : V ^ K such that 

(a') dT{x,y) > dG(x,y) for all x,y € K (with probability 1), 
(b') ET[dT{f(x),f(y))] < Oilogk) dGix,y) for all x,yeV. 



We start with sampling a random tree T' = (/ U L, £") and associated map / from the distribution of [Theorem 2.3 
We can take any leaf I e L whose pre-image set only contains non-terminals, remove the leaf, and remap all 
V e f^^iT) to some other leaf that is a descendent of /'s parent node and also contains a terminal. (Such a leaf is 



guaranteed to exist by property (cj of Theorem 2.3 .) While both the tree and the map change, we continue to call 
the modified tree T' and the map /. We repeat this process until all leaves in the modified tree T' contain at least 
one terminal. We can also assume that all terminals are at non-zero distance from each other (else we can remove 
some terminals, do the same proof, and add back in the terminals at the end) — now property (a) implies each leaf 
contains at most one terminal. Hence /|^ is a 1-1 correspondence between the terminal set K and the remaining 
leaves in the tree T' . Since the tree T' is a 2-HST, the distances in the tree between a remapped non-terminal and 
any other node in T' (apart from the one it was identified with) do not change. 



We can now remove all internal nodes in the modified version of T' (using, say, pGupOl ]) to get a tree T 



iL,E") on just the (erstwhile) leaves such that none of the /(m)-/(v) distances are shrunk, and they are stretched 
by a factor of at most 8. The bijection between the set L and terminals K allows us to view the tree T" as being 
on the node set K, and the map / as being a retraction from V ^ K. Finally, shrinking the edges of the tree T" 
only makes the expected stretch smaller, so we can reduce the length of any tree edge e = ix, y) in T" and set 
it equal to dGix,y). Call this final tree T; it is immediate from properties (a) and (b) that this random T and the 
associated retraction f : V ^ K satisfy properties (a') and (b') above, where the big-Oh term in property (b') 
hides an extra stretch of 8 due to this post-processing. ■ 



As an aside, a weaker version of [Corollary 2.4| with Oi-^^) can be proved as follows. First use fTheorem 2.1 



to obtain a random 0-extension H from G such that E//[t///(;c,y)] < Oi-^^^^^) dGix,y) for all x,y e K. Then 



use the result of [ ]FRT04[ ] to get a random tree H' = iK,EH') such that E//'[(i//'(x, j)] < Oilogk) dHix,y) for all 



x,y € ViH). Combining these two results proves the weaker claim. 

3 Flow-Sparsifiers 

Recall that given an edge-capacitated graph (G, c) and a set c y of terminals, a. flow-sparsifier with quality p 
is another capacitated graph (// - iK,EH),CH) such that (a) any feasible K-flow in G can be feasibly routed in 
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H, and (b) any feasible iT-flow in H can be routed in G with congestion p. 



3.1 Interchanging distance and capacity 



We now use the framework of Racke | |Rac08| ], as interpreted by Andersen and Feige [ |AF09| ]. Given a graph 
G - (V, E), let !P be a collection of multisets of E, which will henceforth be called paths. A mapping M : E ^ P 
maps each edge e to a path M{e) in P. Such a map can be represented as a matrix M in z'^'^'^' where is the 
number of times the edge e' appears in the path (multiset) M{e). Given a collection At of mappings (which we 
call the admissible mappings), a probabilistic mapping is a probability distribution over (or, convex combination 
of) admissible mappings; i.e., define Am > for each M e M such that Y^MeM - 1- 

Distance Mappings. Given G = {V,E) and lengths £ : E ^ ]R>o, 

• The stretch of an edge e e E under a mapping M is Yje' ^e,e'€{e')l£{e). 

• The average stretch of e under a probabilistic mapping {A] is YjM ^M(Ze' e/^(e')/^(^))- 

• The stretch of a probabilistic mapping is the maximum over all edges of their average stretch. 

Capacity Mappings. Given a graph G with edge capacities c : E ^ ]R>o, 

• The load of an edge e' € E under a mapping M is ^e,e'cie)/c{e'). 

• The expected load of e' under a probabilistic mapping [A] is YjM ^M(Ze g'c(e)/c(e'))- 

• The congestion of a probabilistic mapping is the maximum over all edges of their expected loads. 



The Transfer Theorem. Andersen and Feige [ |AF09| ] distilled ideas from Racke [ |Rac08| ] to state: 



Theorem 3.1 (Theorem 6 in [ |AF09 ]) Fix a graph G and a collection M of admissible mappings. For every 



p > 1, the following are equivalent: 

1. For every collection of edge lengths ig, there is a probabilistic mapping with stretch at most p. 

2. For every collection of edge capacities Ce, there is a probabilistic mapping with congestion at most p. 



In our settings, the techniques of Racke [Rac08 ] can be used to make the result algorithmic: if one can efficiently 
sample from the probabilistic mapping with stretch p (which is true for the settings in this paper), one can 
efficiently sample from a probabilistic mapping with congestion 0{p) (and vice versa). In fact, one can obtain an 
explicit distribution on polynomially many admissible mappings. We defer further discussion of efficiency issues 
to the full version of the paper. 

3.2 Tree-Based Flow Sparsifiers 

The distance mappings we will consider will be similar to Racke's application. Let us first fix for each u,v € K 
a canonical shortest-path S uv between u,v in G. Now, consider a tree 0-extension (r, /) where T = {K,Ej) and 
/ : y ^ TiT is a retraction. For each edge e = (w, x) e E{G), consider the (unique) /(w)-/(x)-path Prifiw), fix)) 
in the tree T. Define the mapping Mr : E ^ P coiTcsponding to the 0-extension (T, f) by 

Mt((w, x)) = ^(u,v)ePT(f(w),f{x))S uv (3.1) 

In other words, this maps each tree edge (w, x) to its canonical path; for each non-tree edge (w, x), it considers 
the edges on the tree-path between the images of w and x in the tree, and maps (w, x) to the disjoint union of the 
canonical paths of these edges. Recall that Mt{{w,x)) is a multiset. In the corresponding matrix representation, 
Me e' is the multiphcity of e' in the set ^{u,v)ePT(f(w)j{x))S uv [Corollary 2.4| now implies the following: 
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Theorem 3.2 Given a graph (G, £) with terminals K c V{G), there is a polynomial-time procedure to sample 
from a probabilistic mapping (which is a distribution over tree 0-extensions) with stretch piUst - C?(log^). More- 
over, pdist >\ifKi^%. 

Now we can apply the Transfer Theorem. Recall that in a ^T-flow, all source-sink pairs belong to set K. 

Theorem 3.3 (Tree-Based Flow-Sparsifiers) Given an edge-capacitated graph (G, c), and a set of terminals 
K QV, there is a polynomial-time algorithm that outputs a graph H - (K, Eh) that is a convex combination of 
edge capacitated trees such that: 

(a) every K-flow that can be routed in G, can also be routed in H; and 

(b) every K-flow that can be feasibly routed in H, can be routed with congestion C?(log^) in G. 



In other words, if we were to scale up the capacities in G to route all feasible flows in H, then the factor by which 
we would have to scale up capacities would only be C?(log k). 



Proof: We apply [Theorem 3.1| and [Theorem 3 .21 to G = (V, £) to get a convex combination [Ajj} of maps 
{T = (K, Ej), f) such that each edge in E has an average load of 0{pdist)- Let us see how this implies (a) and (b) 
above: this is essentially a matter of unraveling the definitions. For each such (T, /), we define capacities on the 
edges ej € Ej thus: let (A, B) be node sets of the two connected components of T formed by deleting the edge 
ej, where AVJ B = K.'Let A' = {v eV \ f{v) e A], and B' = V\ A'. Define 

CTjier)- 2 (^-^^ 

ee£n(A'xB') 

We claim that this convex combination {Atj] of capacitated trees satisfies (a) and (b). For (a), the definition of the 
capacities ctj ensures that each edge of G can be concurrently routed feasibly in each T using capacities ctj{-), 
hence so can any ^-flow feasible in G. Since this holds for each {T,f) pair, it holds for the convex combination. 

To prove (b), we want to route edges in the convex combination of trees in the graph G, where we scale the 
capacities ctj of edges from {T,f) by its convex multiplier Ajj. Consider any edge er = (m,v) € Ej with 



capacity crjier) defined in ([3.2|): we can use the canonical shortest path Suv to route this flow. Hence the load 



on any edge e' = (w', x') € E due to the convex combination of trees is at most 



c(e') 4ri 



Atj ^ CTjier). (3.3) 

T,f ereET-.e'eSuv 



Since crjier) is the sum of the capacity of all edges e = (w, x) such that er lies on the unique tree-path between 



f(w),f{x), we rewrite (3.3) as 



^ T,f eT=(u.v)eET:e'eS„y(w,x)eE:eTePT(f(w),f(x)) 

^--j-^-^r,/ ^ c(w;c) X (multiplicity of e' in i±J(„,v)gp^(/(v,)j(;,)) 5 „v)- (3.5) 



T,f iw,.x)eE 



However, this is exactly the expected load for e' under the notion of admissible maps defined in (3.1); hence this 



is bounded by the congestion (the maximum expected load over all edges), which is at most p^jst by [Theorem 3. 1 



This proves condition (b) above, that the congestion to route any ^-flow in the convex combination H in the 
graph G is at most pdist- ■ 
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3.3 General Flow Sparsiflers 



Theorem 3.4 (Flow-Sparsifiers) Given any graph G and terminals K, there is a randomized polynomial-time 



algorithm to output a flow -spar sifter H with loss 0{ 



log log k 



Proof: Suppose we use [Theorem 2.1| instead of using the tree 0-extension resuh ( [Corollary 2.4[ ), we use the 
constructive version of the Transfer Theorem to get a polynomial number of graphs H\,H2, ... on the vertex set 
K such that a convex combination of these graphs is a flow-sparsifier for the original graph G where the load 
is O(j^j^p). We can then construct a single graph H by setting the capacity of an edge to be the appropriate 
weighted combination of capacities of those edges in Hf, all feasible isT-flows in G can be routed in H, and all 



feasible A'-flows in H can be routed in G with congestion 0{ 



log log k 



)■ 



The same idea using 0-extension results for yS-decomposable graphs ( [Theorem 2.2| ) gives us the following: 



Theorem 3.5 (Flow-Sparsifiers for Minor-Closed Families) For any graph G that is ^-decomposable and any 
K, there is a randomized polynomial-time algorithm to construct a flow-sparsifter with loss 0(J3). 

Note that the decomposability holds if G belongs to a non-trivial minor-closed-family Q (e.g., if G is planar). 



However, [Theorem 3.5[ does not claim that the flow-sparsifier for G also belongs to the family Q; this is the 
question we resolve in the next section. 

4 Connected 0-Extensions and Minor-Based Flow-Sparsifiers 

The results in this section apply to yS-decomposable graphs. A prominent example of such graphs are planar 
graphs, which (along with every family of graphs excluding a fixed minor) ai^e 0(l)-decomposable [ KPR93| , 
FT03 ]. Thus, Theorem 4~I| , [Corollary 3.3 and [Theorem 4.3 below all apply to planar graphs (and more generally 
to excluded-minor graphs) with y6 = 0(1). We now state our results for /3-decomposable graphs in general. 



In [Section 4.2[ we define a related notion called terminal-decomposability, and show analogous results for ji- 
terminal-decomposable graphs. 

In what follows we use the definition of 0-extension from [Section 2 with H = Hf, i.e., Ee = {(/(«), /(v)) : (u, v) € 
E], hence the 0-extension is completely defined by the retraction /. We say that a 0-extension / is connected if for 
every x, f~\x) induces a connected component in G. Our main result shows that we get connected 0-extensions 
with stretch 0(filog/i) for y6-decomposable metrics. 

Theorem 4.1 (Connected 0-Extension) There is a randomized polynomial-time algorithm that, given {G - 
(V, E), io) with terminals K such that dg is /3-decomposable, produces a connected 0-extension f : V ^ K 
such that for all u,v e V, we have 

E[dH(m,f(v))] < OifiXogp) ■ dG(u,v). 



Note that if / is a connected 0-extension, the graph Hf is a minor of G. Applying [Theorem 3.1 to interchange 



the distance preservation with capacity preservation, we get the following analogue of Theorem 3.3 



Corollary 4.2 (Minor-Based Flow-Sparsifiers) For every J3-decomposable graph G = (V, E) with edge capac- 
ities Co and a subset K c V of k terminals, there is a minor-based flow-sparsifler with quality 0(J3 log yS) . 
Moreover, a minor-based flow-sparsifter for G, cq, K can be computed efficiently in randomized poly-time. 



Since planar graphs are 0(l)-decomposable and since their minors are planar, by [Corollary 4.2[ they have an 
efficiently constructable planar-based flow-sparsifier with quality 0(1). By [Theorem 4.1 , they always have a 
connected 0-extension with stretch at most 0(1). An interesting consequence of the latter result is that given any 
planar graph (G, £g)^ and a set K of terminals, we can "remove" the non-terminals and get a related planar graph 
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on K while preserving inter-terminal distances in expectation. This generalizes a result of Gupta JGupOlp who 
showed a similar result for trees. (Obviously, this extends to every family of graphs excluding a fixed minor.) 



Theorem 4.3 (Steiner Points Removal) There is a randomized polynomial-time algorithm that, given (G - 

da{x,y) — 



{V,E),{g) '^nd K such that do is ^-decomposable, outputs minors H = {K,Eh) of G such that 1 < '^[^"(;^'>M < 



0(J3 log /3) for all x,y € K. 

Note that these results only give us an C?(log n log log ?i)-approximation for connected 0-extension on arbitrary 
graphs (or an C>(log^ ^ log log ^)-approximation using results of Section 4.2). We can improve that to O(log^); 



the details are in Section 4.3 



Theorem 4.4 (Connected CKR) There is a randomized polynomial-time algorithm that on input {G = {V, E), Cq) 
and K, produces a connected 0-extension f with stretch factor E[J//(/(m), /(v))] < C?(log k) • ddu, v) for all 
u,v eV. 

Using the semi-metric relaxation for 0-extension, we get a connected 0-extension whose cost is at most C?(log k) 
times the optimal (possibly disconnected) 0-extension. To our knowledge, this is the first approximation algo- 
rithm for connected 0-extension, and in fact shows that the gap between the optimum connected 0-extension and 
the optimum 0-extension is bounded by 0{logk). The same is true with an 0(1) bound for planar graphs. We 
remark that the connected 0-extension problem is a special case of the connected metric labeling problem, which 
has recently received attention in the vision community JVKROS , NL09 ]. 



4.1 The Algorithm for Decomposable Metrics 



We now give the algorithm behind [Theorem 4. 1 . Assume that edge lengths £g are integral and scaled such that 



the shortest edge is of length 1. Let the diameter of the metric be at most 2^. For each vertex v € V, define 
Ay = minveA- ddv, x) to be the distance to the closest terminal. The algorithm maintains a partial mapping / 
at each point in time — some of the /(v)'s may be undefined (denoted by /(v) = J.) during the run, but / is a 
well-defined 0-extension when the algorithm terminates. We say a vertex v e V is mapped if /(v) ±. The 
algorithm appeai^s as Algorithm 1. 



Algorithm 1 Algorithm for Connected 0-extension 

1: input: (GJgIK. 

2: let / ^ 0, fix) = X for all x e K, f{v) - ± for all v e V \ K. 

3: while there is a v such that /(v) = J. do 
4: let / <- / 1, r; «- 2' 

5: sample a /3-decomposition of do with diameter bound r, to get a partition P 

6: for all clusters C.v in the partition P that contains both mapped and unmapped vertices do 

7: delete all vertices u in with f{u) + ± 

8: for each connected component C from C.? do 

9: choose a vertex wq e that was deleted and had an edge to C 

10: reset f(u) = f{wc) for all ueC. 

11: end for 

12: end for 

13: end while 



We can assume that in round 6 = log diam(G), the partitioning algorithm returns a single cluster, in which case 
all vertices are mapped and the algorithm terminates. Let f be the mapping at the end of iteration /. For x e K, 
let Vf denote /]r'(x), the set of nodes colored x. The following claim follows inductively: 

Lemma 4.5 For every iteration i and x e K, the set induces a connected component in G. 
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Proof: We prove the claim inductively. For / = 0, there is nothing to prove since Vf - {x}. Suppose that in 
iteration /, we map vertex ii to x so that ii e Vf. Thus for some component C containing u, the mapped neighbor 
wc chosen by the algorithm was in Vf_y. Since we map all of C to x, there is a path connecting v to wc in Vf. 
Inductively, wc is connected to x in Vf_y c Vf, and the claim follows. ■ 

The following lemma will be useful in the analysis of the stretch; it says that any node mapped in iteration / is 
mapped to a terminal at distance 0(2'). 

Lemma 4.6 For every iteration i and x e K, and every u e V^, ddx, u) < Iri. 

Proof: The proof is inductive. For / = 0, the claim is immediate. Suppose that in iteration /, we map vertex u to 
X so that u € V^. Thus for some component C containing u, the mapped neighbor wc chosen by the algorithm 
was inV^ y Moreover, u and wc were in the same cluster in the decomposition so that d{u, wc) < Inductively, 
d{wc, x) < 2r;_i and the claim follows by triangle inequality. ■ 

In the rest of the section, we bound the stretch of the 0-extension; for every edge e = (u, v) of G, we show that 

E[Jg(/(«),/(v)] < 0(J3log/3) dG{u,v). 

Note that for e = {u,v), dG{{f{u),f{v)) = dH{{f{u),f{v)), and so it's enough to prove the claim for dQ. The 
analogous claim for non-adjacent pairs will follow by triangle inequality, but here with du- We say that the edge 
e - (u, v) is settled in round j if the later of its endpoints gets mapped in this round; e is untouched after round 
j if both u and v are unmapped at the end of round / Let dG{u,K) < dG{v,K) and let denote the distance 
dG(u,K). Letje := Uog(Ae)J - 1. 

Lemma 4.7 For edge e = (u, v), 

(a) edge e is untouched after round je - 1, 

(b) if edge e is settled in round j then dcifiu), f(v)) = 0{2J + dciu, v)). 

Proof: For (a), if one of the end points of e is mapped before round jV> then 2 • 2^^ < Ae = dde, K), which 
contradicts Lemma 4.6 . For (b), both dciu, f{u)),dG(v, f{v)) < 2^^' by [Lemma 4.6 ; the triangle inequality 



completes the proof. ■ 

Let Sj denote the "bad" event that the edge is settled in round j and that both end-points are mapped to different 
terminals. Let z '■= maxjA^, dciu, v)}. We want to use 



E[d{f{u),f{v))] - P^'t^y] • Ey(/(M),/(v)) I Sjl 



Claim 4.8 Pr[Sj] < mm{4/3jj, 1} • 5(5^^. 

Proof: Recall that an edge is untouched after round / if neither of its endpoints is mapped at the end of this round. 
For this to happen, u must be separated from its closest terminal in the clustering in round /, which happens with 
probability at most min{jS^, 1). Also recall that the probabihty that an edge e = (w, v) is cut in a round / is at 

most ji^^^r^. Let / denote the round in which the edge is first touched. We upper bound the probability of the 
event Sj separately depending on how / and j compare. Note that for j < 2, the right hand side is at least 1 so the 
claim holds trivially. 



i < j - 2. For Sj to occur, the edge e must be cut in round j - 2 and j - 1, as otherwise it would 

2J'2 '^1 ' P 21' 



already be settled in one of these rounds. The probabihty of this is at most min{yS^^j^, 1) • ^ ^^(m.v) ^ 



min{4;S^,l).2/3^. 
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i = j - \. For Sj to occur, the edge e must be cut in round j - \ and must be untouched after round j - 2. 
The probabihty of this is at most min{/3^, 1) • y6^|^ < min{4yS|j, 1} • 2/3^^^. 



i - j. For Sj to occur, e must be cut in round j and must be untouched after round j - I. The probability 
of this is at most min{y6^, 1) ■ < min{4/3|j, 1) ■ /3^^ 



Since Pr[Sy] - Pr[Sj A (/ < j- 2)] + Pr[Sj A (/ - ; - 1)] + Pr[Sj A (/ = ;)], the claim follows. ■ 
Lemma 4T7| (b) implies that if the edge is settled before round jd '■= [log{dGiu,v))i, the conditional expectation 



E[<iG(/(M),/(v)) I Sj] is 0(dG(u,v)). Moreover the edge e cannot be settled before round je = Llog(^e)J - 1 by 
Lemma 4^ a). Let ],„ := maxij^, je)- It therefore suffices to to show that 



Pr[Sj] ■ 0(2^) < 0(J3\og/3) Jg(",v) 



Plugging in the upper bound for Pr[Sy] into the left hand side, we get 

Zj>j^ mSj] ■ 0{2i) < min{4;S^, 1) • 5;S^ • 0{V) 

< Zj>j,„ mm{4^jj, 1! •/? • 0{dG{u,v)) < OifilogP) dG{u,v) . 

In the last step, we used that z - max{A^, (^^(i^, v)) < max{2^'''''^, 2^''"'^') < 2^'""^^, so the first O(logyS) terms con- 
tribute 0(J3dGiu,v)), while the remaining terms form a geometric series and sum to 0{dGiu,v)). This completes 



the proof of Theorem 4Tl|. 



4.2 Terminal Decompositions 

The general theorem for connected 0-extensions gives a guarantee in terms of its decomposition parameter and 
in general this quantity may depend on n. This seems wasteful, since we decompose the entiix metric while we 
mostly care about separating the terminals. 

To this end, we define terminal decompositions (the reader might find it useful to contrast it with definition of 



decompositions in Section 1.1). A partial partition of a set X is a collection of disjoint subsets (called "clusters" 
of X). A metric (X, d) with terminals K is called p-terminal-decomposable if for every A > there is probability 
distribution over partial partitions of X, with the following properties: 

• Diameter bound: Every partial partition P e supp(ju) is connected and A-bounded. 

• Separation event: For all u,v e X, Frp^^[3S € P such that u e S but v ^ S] < p ■ d{u, v)/A. 

• Terminal partition: For all x e K, every partial partition P e supp(/i) has a cluster containing x. 

• Terminal-centered clusters: For every partial partition P € supp(//), every cluster S e P contains a terminal. 

A graph G = (V, E) with terminals K is yS-terminal-decomposable if for every nonnegative lengths Ig assigned 
to its edges, the resulting shortest-path metric dG with terminals K is y6-terminal-decomposable. Throughout, 
we assume that there is a polynomial time algorithm that, given the metric, terminals and A as input, samples a 
partial partition P e fi. Note that if ^ = V, the above definitions coincide with the definitions of /J-decomposable 
metrics and graphs. 

Our main theorem for terminal decomposable metrics is the following: 

Theorem 4.9 Given (G = (V, E), €g)> suppose dc is ^-terminal-decomposable with respect to terminals K. There 
is a randomized polynomial-time algorithm that produces a connected 0-extension f : V ^ K such that for all 
u,veV, we have E[Jg(/("), /(v))] < 00^ log P) ■ Jg(", v). 

This theorem is interesting when jS is much less than the decomposability of the metric itself. E.g., one can 
alter the CKR decomposition scheme to get p{k, n) - C?(log k), while /? - C?(log n). 
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4.2.1 The Modified Algorithm. 



Algorithm 2 for the terminal-decomposable case is very similar to Algorithm 1: the main difference is that in 
each iteration we only obtain a partial partition of the vertices, we color only the nodes that lie in clusters of this 
partial partition. 

A few words about the algorithm: recall that a partial partition returns a set of connected diameter-bounded 
clusters such that each cluster contains at least one terminal, and each terminal is in exactly one cluster — we use 
to denote the cluster containing x e K. (Hence either = or nV^ = 0.) Now when we delete all the 
vertices in some cluster that are already mapped, this includes the terminal x — and hence there is at least one 
candidate for wc in Line 9| . Eventually, there will be only one cluster, in which case all vertices are mapped and 
the algorithm terminates. 



Algorithm 2 Algorithm for Connected 0-extension: the terminal-decomposable case 

1: input: (GJgIK. 

2: let / ^ 0, fix) = X for all x € K, /(v) - ± for all v € V \ K. 

3: while there is a v such that /(v) = ± do 
4: let / <- / 1, r; «- 2' 

5: find a yS-terminal-decomposition of do with diameter bound r, ; let be the cluster containing terminal x. 

6: for all clusters V'^ in the partial partition do 

7: delete all vertices u in with f{u) + ± 

8: for each connected component C from thus formed do 

9: choose a vertex wq e that was deleted and had a neighbor in C 

10: reset f{u) = f{wc) for all u e C. 

11: end for 
12: end for 

13: end while 



The analysis for Theorem 4.S is almost the same as for [Theorem 4. 1 ; the only difference is that Claim 4.8 is 
replaced by the following weaker claim which immediately gives the 00^ log yS) bound. 

Claim 4.10 Pr[Sj] < min{S^jj, 1) • 23^^^. 

Proof: Recall that an edge is untouched after round / if neither of its endpoints is mapped at the end of this round. 
For this to happen, u must be separated from it's closest terminal in the clustering in round /, which happens 
with probabihty at most min{jS^, 1). Also recall that the probability that an edge e = {u, v) is cut in a round / is 

at most Let / denote the round in which the edge is first touched. We upper bound the probability of the 

event Sj separately depending on how / and j compare. Note that for j < 3, the right hand side is at least 1 so the 
claim holds trivially. 



• i < j - 3. For Sj to occur, it must happen that the edge is cut in round / and it is either untouched or cut in 
rounds 7-1 and j - 2. The probability for this to happen is at most min{yS^^^^, 1) • min{y6(^^ -1- ^j^), 1) • 
yS(^ + < min{%^, l]mm{S0jj, 1} • 40^ jj. If d{u,v) > this is at most mm{S0jj, 1} • 160^^ 
as z = d{u, v). Otherwise, observe that / > je as the edge cannot be touched before. Hence 2' > Ae/4, and 
plugging this in gives a bound of min|8yS^, 1} • 16^^^^^, as well. 

• i - j - 2. For Sj to occur, the edge e must be cut in round j - 2 and it must be cut or untouched in round 
7 - 1, as otherwise it would already be settled in one of these rounds. The probabihty of this is at most 
ySf^l • mmCpi^ + ^), 1) < min{4;6X, 1} . 4^'-^. 
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• i = j - \. For Sj to occur, the edge e must be cut in round 7 - 1 and must be untouched in round j - 2. The 
probability of this is at most min{yS^, 1) -yS^ < mm{4^jj, 1} • 2^^. 

• i = j. For Sj to occur, e must be cut in round j and must be untouched in round j - I. The probability of 
this is at most min{;S^, 1) -yS^ < min{4$jj, 1} -0^. 

Since Pr[S;] - Pr[S,- A (/ < j- 3)] + Pr[S; A (/ = j- 2)] + Pr[S; A (/ = j - 1)] + Pr[S,- A (/ = 7)], the claim 
follows. ■ 

4.3 Connected 0-extension on General Graphs 

Finally, we show that for general metrics, we can do better than the C?(log^ k log log k) guarantee implied by 



Theorem 4.9| . In particulars we now prove Theorem 4.4| , which gives a C?(log k) guarantee. We still use Algo- 



rithm 1 from the previous section, but use a specific decomposition algorithm. The following result follows from 



Fakcharoenpol et al. [FHRT03], who built up on the work of Calinescu, Karloff and Rabani [CKR04]: 



Theorem 4.11 ([ ]FHRT03 ]) Let (G = {V, E), {q) with a terminal set K - {xi, . . . , Xk\ £ V. There is a (random- 



ized) polynomial-time algorithm that produces, for each i - . . . , [log diam(G)l, a collection ofk + 1 clusters 
[C[y Cj , . . . , C[), such that 

(a) (Diameter) For any j + 0, O. contains the terminal xj, and d{xj, v) < 2' for any v e C'j, 

(b) (Separation) For any u,v e X, Pr[37 such that u € C'j but v ^ C'j] < C?(/3"'') • d{u, v)/2', where the proba- 
bility is taken over the internal coin tosses of the algorithm, and 

(c) (Amortization) For any u,v e X, Y^iPf < P - 0(log k). 

(d) (Coverage) U j^^qC'j contains U^_jBj(xj, 2'"^). 

We remark that we do not need each cluster to induce a connected component. Observe that the (Diameter) and 
(Coverage) properties imply: 

(e) (Laminarity) For any i, Uj^qCj^^ 2 U^^oC^ with probability 1. Hence also C'^ 2 C'^^^ with probability 1. 

We run Algorithm 1 with this decomposition; the only worry is that since the clusters are not connected, it may 
be the case that in step ^, we may not find a node wc as desired. In this case, we expel C from C.v, and do not map 
the vertices in C in this iteration. This ensures the connectivity property of /].~'(x)'s. Moreover, the Laminarity 
property inductively ensures that we never map any vertex from CJ, by the end of round /. Since the diameter 
property bounds the diameter of every other cluster. Lemma 4^ continues to hold. 



Now, by its very definition, any expulsion operation only removes components that are disconnected from the 
rest of Cs, and hence does not increase the separation probability for any edge. Moreover, it is still the case the 
if u is mapped before round j and an edge (u, v) is not cut in round j, then the node v gets mapped in round j as 
well. Indeed by laminarity, u is in one of the clusters containing a terminal, and if (u, v) is not cut, then so is v. 



Since u is mapped, the component containing v cannot be expelled. Thus [Claim 4.8| continues to hold and bounds 
the probability of Sj, implying that 

E[difiu),fiv))] = I^jPrlSj] ■ E[dif{u),f{v)) \ Sj] 

< Oidciu, V)) + Zj>j' mSj] ■ 0{V) 

< Oidciu, V)) + Y.j>j' mm{4pjj, 1} • 5/3f^^ ■ OiV) < Oifiddu, v)). 

Since /? = C?(log k), this gives us connected 0-extensions where the stretch is 0(log k), and hence finishes the 
proof of pTheorem 4.4 . 
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5 Future Directions 



We gave a set of results on and around the idea of flow-sparsifiers and 0-extensions. Some of these results aie 
not tight, and it would be interesting to obtain better bounds for these problems. Another interesting direction 
for future work is this: define an l-sparse-extension of graph G - {V,E) with terminals K to be any graph 
H = {Z,Eh) with \Z\ = £, K c Z c V, along with a retraction f : V ^ Z that satisfies dHix,y) > dG{x,y) for 
all x,y ^Z. (Note that a |^|-sparse-extension is just a 0-extension; one possible |V|-spai-se-extension is G itself.) 
What if we consider ^-sparse-extensions {H, f) with 

E[ dnifix), f{y)) ] < a d^x, y) for all x,yeV, 

where ideally ( = poly(fc), and a = 0(1) (or just a «; j^j^j^)? In other words, if we are willing to retain a small 
number of non-terminals, can we achieve better stretch bounds? Note that standard lower bounds for 0-extension 
have the property that \V\ = poly(/:) — whence the entire graph G is a "good" solution (poly(^)-sparse-extension 
with a - 1). 
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A Lower Bounds 

In this section, we show two kinds of lower bounds. The first shows that any flow-sparsifier that is a convex com- 
bination of 0-extensions must sulfer a loss of Q( ^/logk) — for such extension, this improves on the Q.{log log n) 
lower bound for (arbitrary) flow-sparsifiers [ ]LM10| ]. The second shows that any flow-sparsifier that only uses 
edge capacities which are bounded from below by a constant, must suffer a loss of Q( yjlogk/ log log k). 

A.l Lower Bounds for 0-extension-based Sparsifiers 

The following result can be viewed as following from the duality between 0-extensions and 0-extension-based 



flow-sparsifiers (Theorem 3.1); by that theorem, not only do good 0-extension algorithms give good 0-extension- 
based flow-sparsifiers, the converse would also be true — and hence one can use a lower bound of Calinescu 
et al. [ |CKR04 ] to infer lower bounds on 0-extension-based flow-sparsifiers. The foUowing theorem gives the 
explicit construction obtained thus. 

Theorem A.l For infinitely many values ofk, there is a graph G = {V, E) and a set K a V of size kfor which 
any flow-sparsifier that is a convex combination of 0-extension graphs has quality at least Q.{ yjlog k). 



Proof: We use the lower bound of Q.{^J\ogk) on the 0-extension integrality ratio by Calinescu et al. [ ]CKR04 ]. 
For completeness we describe their construction: Let G be an expander with n vertices, maximum degree A and 
expansion at least a, where A and a are fixed parameters. Define / - |" -y^log «j and k - Choose any k 

distinct vertices h\, hk e V {G) and add k new paths of length / starting at these vertices and ending at new 

vertices labeled l,...,k. Denote the resulting graph by G' (note that \ V {G')\ = 0{n) and \E{G')\ = 0(n)), and 
let the terminals K be the new vertices {1, . . . , ^). Set the capacities and lengths of the edges to 1. The distance 
distG' (s, t) between terminals s, t is set to be the shortest path distance in G' between s, t. 
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For the described instance G',K of the 0-extension problem, Calinescu et al. show a semimetric whose cost is 
\E (G')\, namely the shortest path metric dist^/ in G' with respect to edge lengths of 1. Indeed, 



I^e=iM,v)eE(G') ^^Pc ^isto' («, v) ^ (G')| = O (n) . 

On the other hand, they show that there exists a universal c > such that for any 0-extension function / : 
V{G')^K, 

I.e=(u,v)eE(G') capc (e) ' distc (/ {u) , f (v)) > cn yjlogn ^a[n yjlogk) . 

We use the instance G',K to construct a feasible solution to the dual LP for which the objective value y6 is at 
least Q ( -v/iogT). To construct the feasible solution, we need to specify the values of the dual variables £ (e) 

for e e E{G'), and dist(5',0 for s,t e K. We set {{e) to be j^^- Thus, T,eeE(G')CapG' {e){{e) = 1. We set 
dist (s, t) to be the shortest path distance between terminals s, t in G' with respect to edge lengths £ {e). Clearly 
the specified values of £{e) , dist {s, t) form a feasible solution to the dual LP for some value of (3. By the above 
proof of Calinescu et al., for any 0-extension function / : V {G') — > K, 

Y.e=(u,v)eE{G') caPc C^") dist (/ (m) , / (v)) > "\^^' = O. ( Vlog") = ^ ( yj^ogk) . 

Thus, we have show there exists a feasible solution to the dual LP such that /3 > Q{ -^log k). This implies that 
for any convex combination of 0-extensions H = ^ ^iHj, the minimum congestion of routing H in G' is at least 
Q.( ^Jlog k), completing the proof. ■ 

A.2 Lower Bounds for Sparsifiers having no Small Edges 

Theorem A.2 For infinitely many values ofk, there is a graph G - {V, E) and a terminal set K c V of size kfor 
which any flow-sparsifier with edge capacities at least s > has quality at least D,{£ -y/log k/ log log k). 

Proof: Let « be a sufficiently lai^ge prime. Let G = (V, E) be a graph whose nodes correspond to the elements of 
Z„ and that contains an edge {m, v) if v = m -i- 1, v = m - 1, or v = (all operations are w.r.t. Z„ and we define 
0~^ as 0.) In other words the graph consists of a Hamiltonian cycle plus some additional edges. This graph G is 



a 3-regular expander (see, e.g., [|IILW06|]). 

Choose the set of terminals K as {/ • [ ^j\ogn'\ | < / < ^ - 1), with k - n/f ^Jlogn]. To simplify notation, we will 
omit floor- and ceiling-operations in the following. For / e [0, ^ - 1], let B, be the set of the ^J\ogn nodes on the 
Hamiltonian cycle between terminal / and / -i- 1 , including / but excluding i + I. 

Let H = (K, Eh) be a flow sparsifier for G with edge capacities at least £ > 0. Let d be the maximum weight 
degree of H, where the weighted degree of a node is the sum over all capacities of incident edges. 

Claim A.3 The maximum weighted degree d of H is at least 

^]\ogn 

C • £ ' 

log log n 

for some constant c' . 

Proof: Consider a demand of 1 jk between all pairs of terminals. 

Since the minimum edge capacity is at least e, the unweighted degree of H is at most dje. Due to this bounded 
degree, for sufficiently large k, there are at least k^ jA terminal pairs that have distance at least \ogkj{l\og{die)) 



from each other (see e.g. [jCKROJ, Lemma 4.2]). 
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Each of these pairs induces a load of Ilk on at least \ogk I {2\og{d I s)) edges. Therefore, the total load in the 
network is at least ^log^/(81og((i/e)). Since H has at most k ■ d/ile) edges, the congestion in H is at least 
£logk/i4dlogid/E)). 

The same demand can be routed with congestion at most (c + 1 ) y]\ogn in G, for some constant c depending on the 
edge expansion of G. Say each terminal / sends a total flow of 1. We can distribute this flow evenly between the 
nodes in B, using only edges inside of B, and with congestion of at most 1. This can easily be done, since we can 
send this flow along the Hamiltonian cycle to reach every node in Bj. Now, we route a uniform multicommodity 
flow on the whole expander, where the flow leaving each node is 1/ ^^Jlogn, i.e., the demand between every pair of 
nodes is l/{n ^logn). This requires congestion at most clogn ■ (1/ -y/log n) - c ^J\ogn [ LR99| ]. Finally, the flow 



in each B, is routed inside B, to the respective terminal. Again, this can easily be done with congestion 1. In total, 
we sent a flow of Ijk between all pairs of terminals and the congestion is bounded by c yj\og n + 2 < (c+ 1) yjlogn. 

Hence, we identified a demand, that requires congestion at least e log k/{4d log((i/e)) in H but can be routed with 
congestion at most (c + 1) -y^log n in G. Since H is a flow sparsifier, its congestion has to be bounded by the 
congestion in G and thus, elog k I {Ad\og{d I e)) < (c + 1) y/logn. It follows that 

d, /d\ iogk 



-logp) 



4(c + 1) Vlogn 

Using the fact that k - n/ yjlog n, the claim follows. ■ 
Now pick a node in H that as weighted degree at least c' • e • yjlog n/ log log n (such a nodes exists due to 



Claim |A.3[ ). Consider the situation in which the demand between this node and every other node corresponds to 
the capacity of the edge connecting them in H, and all other demands are 0. Clearly, in H this can be routed with 
congestion 1. The terminal in G corresponding to node u, however, has only degree 3. Therefore, routing this 
demand in G results in congestion at least c' ■ s ■ yJlogn/{3 log log n) > c' ■ e ■ ^Jlogk/{3 log log k), since that is 
the load on at least one of the outgoing edges of u. ■ 

B Applications 



Most of these applications were considered by Moitra [ Moi09| ], and Leighton and Moitra [ LMIO] ]; we show how 



our results above give improved approximations to the problems. 
B.l Steiner Oblivious Routing 



Theorem 3.3| is an exact analogue of Racke's theorem on general flows [ Rac08 ] for the special case of /T-flows, 



and hence immediately gives an C?(log ^)-oblivious routing scheme for /^-flows. 
B.2 Steiner Minimum Linear Arrangement 

Given G = iV,E) and K c V with \K\ = k, the goal in the Steiner Minimum Linear AiTangement (SMLA) 
problem is to find a mapping F : V ^ [k] such that F\k : K ^ [k] is a. bijection. The goal is to minimize 



Tj(u,v)eE (^uv\Fiu) - F(v)|. Note that for the non-Steiner MLA case where K = V, Rao and Richa [ |RR98p gave an 
0(log ?i)-approximation for general graphs and an (9(log log ?i)-approximation for graphs that admit (9(l)-padded 
decompositions (which includes the family of all trees). 

For our algorithm, we take a random tree/retraction pair (T, f) from the distribution above; this ensures that the 
cost of the optimal map F* (viewed as a solution to the MLA problem on T) increases by an expected 0(log k)- 
factor. Now solving the MLA problem on the tree to within an 0(\og\ogk) factor to get a map Fj . K ^ \k\, 



and defining F{x) = Fjifix)) gives us an expected C?(log ^ log log ^)-approximation. We show in [Appendix C 
that this can be improved slightly to 0(log k) using a more direct approach. 



16 



B.3 Requirement Cut 



For requirement cut, [ pNR10 | already present an 0(log ^ log g)-approximation. 



B.4 Steiner Graph Bisection 

In this problem, we are given a value k' and want to find a bipartition (A, V \ A) of the graph such that \Ar\K\ = k' , 
and that minimizes the cost of edges cut by the bipartition. The approach of Racke, which embeds the graph 
G into a random tree and finds the best {k' ,k - k') bipartition on that, gives us an Oilogk) algorithm for this 
partitioning problem. 

B.5 Steiner ^-Multicut 

In this problem, we are given terminal pairs {sj, and a value k' < k, and we want to find a minimum cost 

set of edges whose deletion separates at least k' terminal pairs. Again, we can embed the graph into a random 



tree losing an O(logfc) factor, and use the theorem of Golovin et al. QGNSOq ] to get a 4/3 + e-approximation on 
this tree; this gives us the randomized 0(log ^)-approximation. 

B.6 Steiner Min-Cut Linear Arrangement 

The Steiner Min cut Linear Arrangement (SMCLA) problem is defined as follows: Given G = {V, E) and K QV 
with \K\ - k, we want to find a mapping F : V ^ \k\ such that F\k : A" is a bijection. The goal is 

to minimize rnax,- X^gf-i([,])^,^/r-i([,]) c.ry. For the non-Steiner version of the problem, Leighton and Rao [ LR99 ] 



show that given an cy-approximation to the balanced partitioning (or to the bisection) problem, one can get an 



C?(a log «)-approximation to the MCLA problem. Using [ |ARV09| |, this gives an 0(log''^ n)-approximation to the 
MCLA problem. 

We note that the reduction works immediately for the Steiner version of the problem: given an a-approximation 
to Steiner-bisection, one gets an 0{a log ^)-approximation to SMCLA. Thus we get an 0(log^ ^)-approximation 



to the SMCLA problem. We show in |Appendi?rC that this can be improved to C?(log^^ k) using a more direct 
approach. 



C Better Algorithms Using a Direct Approach 

The vertex sparsifiers give a modular approach to solving steiner version of various problems. Not surprisingly, 
for some of these problems, a direct attack will lead to better algorithms. In this section, we show that applying 
known techniques for Minimum Linear Arrangement (MLA) problem lead to a better approximation ratio for 
Steiner MLA, and for Steiner Minimum Cut Linear Arrangement. 

C.l Steiner Minimum Linear Arrangement 

Recall that the Steiner MLA problems is defined as follows. Given G = (V, E) and K Q V with \K\ = k, the 
goal is to find a mapping F : V ^ [k] such that F\k : K ^ [k] is a bijection. The goal is to minimize 
Ti(u,v)eECuv\Fiii) - F{v)\. Specifically, we show the following result: 

Theorem C.l There is a polynomial time 0(log k)-approximation algorithm for the SMLA problem based on the 
natural linear programming relaxation. 
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Proof: The linear program for the SMLA problem is based on the spreading metric linear programming relax- 
ation for MLA introduced in [ |ENRSO0| ]. 



mm 
subject to: 
(Triangle Inequality) 
(Spreading) 



X(n,v)e£' Cuvduv 



> 



Vu,v,w eV 

!^ VS QK,\S\>2,ueS 



> Vu,veV 



It follows from [|NRS0|] that the above is a valid linear programming relaxation to the SMLA problem, and 
that one can efficiently separate for the spreading constraints so that the LP can be solved in polynomial time 
using the Ellipsoid algorithm. Further, it is easy to check that the spreading constraints imply that for any u e K, 
|B(/(m, r) D K\ < 5r. (Here, Brf(v, r) = {w | d(v, w) < r] is the "ball" around v of radius r in the metric d.) 



Let J be a solution to the above linear program. Since d is a metric on V, it follows from Theorem 2.3 that we 
construct a (random) edge-weighted 2-HST T = (I U K, Ej) with internal nodes / and leaves K, and a retraction 
f:V^K such that 

(a) dT{f{x)J{y)) > d{x,y) for all x,y e K (with probability 1), 

(b) ET[dT(f{u),fiv))] < 0{logk) d{u,v) for all u,veV, 

We argue that given this HST, we can construct a mapping Ft '■ V ^ [k] such that Ft\k : K ^ [k]is a. bijection. 
This mapping will have the property that IFjiu) - Fjiv)] < 5dTifiu),f{v)). The approximation ratio of C?(log^) 
then follows from property (b) above. 

The mapping Ft is defined by taking the natural left-to-right ordering on K defined by T, and assigning every 
other vertex v e V to the position /(v). Formally, let ;r be a pre-order traversal of T. For every terminal x € K, 
set FTix) to the number of terminals in k that occur before x, i.e. FTix) = \K n [ni : / < n^^{x)]\. For every other 
vertex u e V, set FTiu) = FTifiu)). It is easy to check that Ft\k is a bijection. 

We next upper bound \FTiu) - fr(v)| for u,v € V. Consider the terminals f„ = f{u),tv = /(v); if f„ = ty, then 
FTix) = FT(y) and there is nothing to prove. Else let r„v be the smallest subtree of T containing and By the 
properties of the HST, we have (ir(?„, f,.) > <ir(?in z) for all z e Txy Moreover, dTiu, v) = <i7-(f„, ty). Now, 



\FTiu) - Ft(v)\ = \FT(tu) - FT(ty)\ 

<\Kn r„„| 

< \KnBd^{t,„dTitu,tv))\ 

< \KnBd{tu,dT{tu,U))\ 

< SdTitu, tv) 

= 5dT(u, v). 



(Since dTitu, tv) > dT{tu,z) for all z e T„y) 
(By property (a)) 
(By the spreading property) 



This proves Theorem C.l 



C.2 Steiner Min Cut Linear Arrangement 

Recall that the Steiner Min cut Linear Arrangement (SMCLA) problem is defined as follows. Given G = {V, E) 
and K c V with |^| = k, the goal is to find a mapping F : V ^ [k] such that F\k : ^ ^ [^] is a bijection. The 
goal is to minimize vaaxj YjxeF-'([i]),y<iF-'([i]) '^xy Specifically, we show the following result: 



Theorem C.2 There is a polynomial time C?(log' '^ k)-approximation algorithm for the SMCLA problem. 
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The algorithm and the proof are the natural generalization of the 0(log^^ n) approximation to the min cut linear 
arrangement problem. We sketch the argument here. 



This algorithm is based on an SDP formulation and the sparsest cut algorithm of [ARVOS], who show the 
following 

Theorem C.3 There exist a constant e > such that the following holds. For any k-point metric (S,d) 
satisfying Yjx.yeS dxy ^ there are sets A,B Q S such that \A\, \B\ > sk and d{A, B) > Moreover given 

vectors {v^ : e 5 ) representing d, such sets A, B can he found in polynomial time. 

Consider first the following linear program: 

niil^ I]Cv,y)e£: Cxydxy 

subject to: 

(Triangle Inequality) d^z - d„ - dy. < V.^;, y,z^V 

(Balance) I,x,yeKdxy > 'x" 

djcy > yx,yeV 



Let F : V ^ [k]he the optimum MCSLA with value OPT. Then the cut separating F"'([[|j]) from its comple- 
ment has value at most OPT, and gives a feasible integral solution to above linear program. Thus the value of the 
relaxation above is at most OPT. 

Suppose in the above linear program, we additionally require that the distance metric d be an metric, i.e. there 
exists vectors v^^ e such that d{x,y) = \\vji - Vy\\j. This program can be naturally written as an SDP, and 
can be solved in polynomial time to return vectors {vx). Moreover, the optimum to this relaxation has value at 



most OPT as well. Theorem C.3 then implies that we can find sets A,BcK such that \B\ > sk and where 



d{A,B) > A = — p=. Consider the sets A^ = {x e V : diA,x) < r}. For < r < A, it is immediate that 

A/log/: 



ACA,-QV\B. 

Picking r at random from (0, A), we observe that for any x,y e V 

Pr[x € Ar,y i. Ay\ < {d(y,A) - d{x,A))IA, 

so that by triangle inequality, the expected cost of the cut (A^, V \ Ar) is at most ^ Yi(x,y)eE Cxydxy < OPT /A. Thus 
we can find an r € (0, A) such that 

(a) \K n Ar\, \Kn{V\ Ar)\ < (1 - s)k. 

(b) i:xeA,.ytA,Cxy < 0{OPT ^\^). 

We can recursively compute steiner linear arrangements for Ar and V \ Ar, and by condition {a), the depth of the 
recursion is at most 0(}ogk). For any /, we can thus bound the total cost of edges from F^^{{i\) to V \ F~^{{i\). 
Indeed each level of the recursion contributes at most 0{OPT y]\og k) to this cost. Since there are at most 0(log k) 
levels, we get an C?(log^-^ k) approximation. 
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